Adaptive Learning as Potential Descent

نویسنده

Mathew Davies

چکیده

Adaptive learning techniques are used in numerous algorithms for classification, prediction and strategic game play1, including boosting. However, these techniques are not unique to computational learning theory. Adaptive learning approaches are also used in the social sciences2, particularly in stochastic game theory. The goal of this paper is to show that there exist significant connections between adaptive learning in contemporary game theory, and adaptive learning in computational learning theory. For instance, the GRL model of adaptive learning for binary choice games [5], a particular case of which is the Roth-Erev (RE) stochastic learning model, is related to Hart and Mas-Collel’s regret-matching algorithm for finding correlated equilibria [10]. Both algorithms (along with several other important learning procedures) are special cases of the potential-descent framework of Cesa-Bianchi and Lugosi [3]. This framework is important for at least two reasons. First, it permits the generalization of a large number of important adaptive algorithms in computer science as well as in game theory. Second, it gives a new theoretical basis for the derivation of bounds on loss and convergence which can in some cases be applied to learning models in the social sciences, as we will show with the RE model. The connections between adaptive learning in game theory and computer science can be seen as instances of the relationship between artificial intelligence and game theory discussed by Tennenholtz [17]. In particular, Tennenholtz cites three fundamental issues of relevance in both fields: reasoning and rationality in distributed environments, learning in uncertain

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Designing stable neural identifier based on Lyapunov method

The stability of learning rate in neural network identifiers and controllers is one of the challenging issues which attracts great interest from researchers of neural networks. This paper suggests adaptive gradient descent algorithm with stable learning laws for modified dynamic neural network (MDNN) and studies the stability of this algorithm. Also, stable learning algorithm for parameters of ...

متن کامل

A survey of Algorithms and Analysis for Adaptive Online Learning

We present tools for the analysis of Follow-The-Regularized-Leader (FTRL), Dual Averaging, and Mirror Descent algorithms when the regularizer (equivalently, proxfunction or learning rate schedule) is chosen adaptively based on the data. Adaptivity can be used to prove regret bounds that hold on every round, and also allows for data-dependent regret bounds as in AdaGrad-style algorithms (e.g., O...

متن کامل

Analysis Techniques for Adaptive Online Learning

متن کامل

Evaluation of gradient descent learning algorithms with adaptive and local learning rate for recognising hand-written numerals

Gradient descent learning algorithms, namely Back Propagation (BP), can significantly increase the classification performance of Multi Layer Perceptrons adopting a local and adaptive learning rate management approach. In this paper, we present the comparison of the performance on hand-written characters classification of two BP algorithms, implementing fixed and adaptive learning rate. The resu...

متن کامل

Position Control of a Pulse Width Modulated Pneumatic Systems: an Experimental Comparison

In this study, a new adaptive controller is proposed for position control of pneumatic systems. Difficulties associated with the mathematical model of the system in addition to the instability caused by Pulse Width Modulation (PWM) in the learning-based controllers using gradient descent, motivate the development of a new approach for PWM pneumatics. In this study, two modified Feedback Error L...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره شماره

صفحات -

تاریخ انتشار 2005

Adaptive Learning as Potential Descent

نویسنده

چکیده

منابع مشابه

Designing stable neural identifier based on Lyapunov method

A survey of Algorithms and Analysis for Adaptive Online Learning

Analysis Techniques for Adaptive Online Learning

Evaluation of gradient descent learning algorithms with adaptive and local learning rate for recognising hand-written numerals

Position Control of a Pulse Width Modulated Pneumatic Systems: an Experimental Comparison

عنوان ژورنال:

اشتراک گذاری